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Integrin a5pl is a major cellular receptor for the extra- 
cellular matrix protein fibronectin and plays a funda- 
mental role during mammalian development. A crystal 
structure of the a5pl integrin headpiece fragment bound 
by an allosteric inhibitory antibody was determined at 
a 2.9-A resolution both in the absence and presence 
of a ligand peptide containing the Arg-Gly-Asp (RGD) 
sequence. The antibody-bound (31 chain accommodated 
the RGD ligand with very limited structural changes, which 
may represent the initial step of cell adhesion mediated by 



nonactivated integrins. Furthermore, a molecular dynam- 
ics simulation pointed to an important role for Ca^* in the 
conformational coupling between the ligand-binding site 
and the rest of the molecule. The RGD-binding pocket is 
situated at the center of a trenchlike exposed surface on 
the top face of a5|3l devoid of glycosylation sites. The 
structure also enabled the precise prediction of the accep- 
tor residue for the auxiliary synergy site of fibronectin on 
the a5 subunit, which was experimentally confirmed by 
mutagenesis and kinetic binding assays. 



Introduction 

Integrins are cell adtiesion receptors that transmit bidirectional 
signals across the plasma membrane and link the extracellular 
environment to the actin cytoskeleton (Hynes, 2002). All inte- 
grins are noncovalently linked heterodimeric molecules consist- 
ing of one a and one (3 subunit, in which both subunits are 
required to create a functional binding site at the membrane- 
distal part of the cell surface receptor for specific extracellular 
ligands. Compared with other cell adhesion receptor classes, 
integrin' s ligand recognition mechanism is highly unique in three 
aspects. First, the ligand recognition specificity of each integrin 
heterodimer is determined combinatorially, in that both the 
a and the p subunits contribute to the selective ligand binding 
of the resultant heterodimeric receptor, and even the same 
P (or a) subunit will bind different ligands when paired with a 
different a (or p) subunit (Hynes, 2002). Second, in contrast to 
other divalent cation-dependent cell adhesion molecules such 
as cadherins in which metals do not directly bridge two mole- 
cules across the cell-cell junction (Patel et al., 2006), the core 
mechanism of integrin-ligand recognition involves a direct co- 
ordination bond between an Mg^* bound on the integrin (called 
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the metal ion-dependent adhesion site [MIDAS]) and acarboxy- 
late oxygen from the ligand. Lastly, the ligand-binding affinity 
of integrins can be modulated allosterically via conformational 
changes that take place outside the binding pocket (Carman and 
Springer, 2003). 

The determination of crystal structures of p3 integrin 
ectodomain fragments have contributed enormously to our 
understanding of the integrin-Ugand interactions (Xiong et al., 
2001, 2002, 2009; Xiao et al., 2004; Springer et al., 2008; Zhu 
et al., 2008, 2010). Specifically, the structures of aVpS and 
allbp3 integrins in complex with their cognate peptide ligands 
represented by the Arg-Gly-Asp (RGD) sequence revealed 
how the small tripeptide portion is specifically recognized by 
integrins using a small binding cleft at the subunit interface 
and how the ligand binding is associated with the transition 
from the closed or low-affinity conformation to the open or 
high-affinity conformation of integrin. However, there remain 
important unanswered questions. For example, lack of an atomic 
resolution structure of integrin in complex with a protein ligand, 
which usually bears both a core binding motif such as RGD and 
a secondary synergy site, precludes the complete understanding 
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of the basis for the physiological binding events. Also, two 
different conformations of (33 integrin were found in the ligand- 
bound state, leading to a controversy over the structural path- 
way that leads to the physiological activation/ligand binding 
for integrins. Another important issue is whether the same 
ligand recognition and affinity modulation mechanisms apply 
for integrins outside the p3 class. It is particularly important 
to obtain structural information about (31 integrins because 
they constitute the largest and probably the most ancient inte- 
grin subclass (Brower et al., 1997) and are fundamentally in- 
volved in mammalian development. 

We report herein the crystal structure of a ligand-binding 
fragment of human a5(31 integrin, a prototypic integrin that 
functions as an RGD-dependent fibronectin receptor. The struc- 
ture, solved as a complex with a Fab fragment of the anti-(31 
inhibitory antibody SG/19, revealed high similarity to the ligand- 
unbound form of aVp3 and allb(33 integrins. Surprisingly, the 
RGD peptide can be introduced into the binding pocket by 
soaking, without causing any conformational change in integrin 
except for an ^1-A shift of one residue and the dissociation of 
Ca^* from the adjacent to the MIDAS (ADMIDAS). Docking 
simulations and structure-based mutagenesis identified a single 
a5 residue responsible for the strong preference of a5(31 for 
fibronectin, establishing a basis for the combinatorial roles played 
by each subunit during the specific recognition of protein ligands. 
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Results 

Despite extensive efforts, our initial attempts to crystallize 
the full-length a5(31 ectodomain fragment were unsuccessful. 
When we imaged various recombinant soluble integrins using 
negative-stain EM, we realized that the conformation of a5(31 
was highly heterogeneous as a result of its flexible lower half 
(unpublished data), whereas full-length aVpS integrin showed 
uniformly compact bent conformation reminiscent of the crys- 
tal structure (Takagi et al., 2002). We reasoned that this con- 
formational heterogeneity was to blame for the difficulty in 
crystallizing full-length a5(31 and thus focused on the truncated 
fragment devoid of the lower half (referred to hereafter as the 
headpiece fragment). A previous EM study showed that SG/19 
binds at the junction between the pA and the hybrid domains 
of (31 subunit, thereby fixing the conformation of the hybrid 
domain, which is highly mobile in the context of the truncated 
fragment (Luo et al., 2004). We took advantage of this phenom- 
enon to crystallize the a5pl integrin headpiece fragment by 
making a complex with SG/19 Fab in the presence of physio- 
logical concentrations of Mg^* and Ca^"^. The structure, refined 
at a 2.9-A resolution, contained all of the domains present in 
the constructs, including the p-propeller and thigh domains of 
a5 subunit, the plexin/semaphorin/integrin (PSI), hybrid and (3A 
domains of (31, and the SG/19 Fab (Fig. 1 A; see also a rocking 
video in Video 1). Two a5(31-Fab complexes were contained 
in one crystallographic asymmetric unit and were essentially 
identical except for minor differences in the interdomain angles 
at the lower half of the molecule (Fig. SI). Therefore, we de- 
scribe the structure of one complex (chains A, B, E, and F) in 
the following discussion. 



Figure 1. Structure of tfieaS^I integrin headpiece in complex with SG/ 1 9 
Fob. (A) Ribbon presentation of the overall structure with disulfide bonds in stick 
model. Individual domains are differently colored in magenta (p-propeller), 
red (thigh), cyan (pA), blue (hybrid), and purple (PSI), and bound metal 
ions are shown as spheres (yellow for Co^* and purple for Mg^*). SG/1 9 Fab 
is colored in gray (light chain) and wheat (heavy chain), with their CDR 
loop regions highlighted in green and yellow, respectively. (B) Close-up 
view of the SG/1 9-binding interface. SG/1 9 Fab and pi chain ore shown 
in surface and ribbon presentations, respectively, with the same color code 
used in A. CDR regions and important interface residues are labeled. 
(C) Superposition of three integrin headpiece structures in the ligand- 
unbound form. Blue, a5pl; red, aVp3 (SUE); green, allbpS (3FCS). On 
the right is a blowup of the region around the metal-binding sites in the pi 
(blue) and p3 (green; 3FCS) pA domains. 

The allosteric inhibitory antibody SG/19 makes extensive 
interactions with the long loop between p strands X and A (fol- 
lowing the strand nomenclature by Xiong et al. [2001]) of (31 
hybrid domain using its antigen-combining sites (Fig. 1 B). The 
species-specific Thr82 that had been identified as the epitope 
for SG/19 was in fact deeply inserted into the pocket formed by 
complementarity-determining region (CDR) loops HI, H2, H3, 
and L3. Furthermore, SG/19 also interacts with pA domain res- 
idues N151, R154, and R155 using its L2 loop outside the CDR 
core, anchoring the lower half of the al helix of pA domain. 
Because of this two-sided interaction, the H3 loop is wedged 
between pA and hybrid domains and prevents the outward 
swing of the hybrid domain. This resulted in the closed head- 
piece conformation with the tucked hybrid domain, similar to 
the ligand-unbound, low-affinity form of the p3 integrins. The 
similarity in the overall conformation can readily be appreciated 
upon structural superposition of the a5pl headpiece onto the 
closed avp3 (3UE, 1 .92-A root-mean-square deviation [RMSD] 
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a5 218 QTRQAS-SIYDDSY 230 

aV 209 ATRTAQ-AIFDDSY 221 

allb 221 LSFDSSNPEYFDGY 234 

a8 216 QTEVAP-ASYDDSY 228 

Figure 2. Recognition of RGD peptide by aSpl . (A and B) Close-up views 
of the region surrounding MIDAS before (A) and after (B) ligand peptide 
soaking. Fo-Fc omit maps, in which RGD, metals, and pi residues Tyrl 33- 
Metl 35 were removed from the models, are shown in magenta wireframes 
at a contour level of 3 a. For reduction of model bias from the omitted 
atoms, the partial models were subjected to refinement before phase cal- 
culation. Note that the electron density corresponding to ADMIDAS Ca^* 
disappeared in the RGD-bound structure. (C) Specific recognition of RGD 
peptide by three different classes of integrins: aSpl (top], aVp3 (1L5G; 
middle), and allbpS (2VDR; bottom). Hydrogen and coordination bonds 
ore shown in dashed lines. Metal ions in the pA domain are shown in 
spheres. (D) Sequence alignment around the Arg acceptor residues of 
RGD-dependent a subunits. Head-on and side-on binding residues ore 
highlighted in pink and yellow, respectively. 

for 864 Ca atoms) and allbpS (3FCS, 1.52-A RMSD for 820 
Ca atoms; Fig. 1 C). 

The three metal-binding sites found in |33 integrin were 
also present in pi, and we could see strong electron density at 
all sites (Fig. 2 A and Video 2), indicating that they were 
occupied. Based on the coordination geometries and similarity 
to the previously determined structures of p3 integrins, we as- 
signed Mg^* to the MIDAS and Ca""^ to the two flanking sites, the 
ligand-associated metal-binding site (LIMBS; also called SyMBS) 
and ADMIDAS. The correctness of the metal assignments was 
also supported by an experiment using a crystal treated with Mg^"^ 
and EGTA, in which electron density for MIDAS remained 
unchanged, whereas that of ADMIDAS disappeared (unpub- 
lished data). Density for LIMBS remained after this treatment, 
but the completely buried nature of this ion may have prevented 
the access of EGTA. The coordination environment for MIDAS 
and ADMIDAS ions in the current complex was essentially 
identical to that of ligand-free p3 (Fig. 1 C). Therefore, binding 
of SG/19 not only induced the overall closed conformation of 
the headpiece (i.e., tucked hybrid domain) but also stabilized 
the low-affinity MIDAS configuration. 

A previous kinetic study indicated that SG/19 attenuates, 
but not abolishes, the Ugand-binding capability of Mn"*-activated 







Mn 

no mAb 


. CaVMg 

200 nM^-^ 






100 iiM^l 






if SO|iM_ — 




1 +SG/19 


Time (s) 


+mAb 16 
1 1 


1 1 



0 50 100 150 

Time {s) 

Figure 3. Surface plosmon resonance binding analysis of the aSpl head- 
piece fragment to Fn7-10. Purified and unclasped a5pl headpiece frag- 
ment at a concentration of 1 00 nM was flowed for 60 s over the surface of 
immobilized Fn7-10 ('^1,400 resonance units [RU]) in a buffer containing 
1 mM MnCl2. The integrin was preincubated with SG/19 Fab (10-fold 
molar excess; dotted line) or mAb 1 6 IgG (fourfold molar excess; gray line) 
or left untreated (solid line), (inset) Concentration-dependent binding sen- 
sorgroms of the same unclasped a5pl headpiece fragment onto Fn7-10 
in a buffer containing 1 mMCaCl2and 1 mM MgCl2. This experiment was 
completed once. 

full-length aSpl integrin (Luo et al., 2004). We confirmed this 
finding in surface plasmon resonance assays using the truncated 
headpiece fragment of a5pl (Fig. 3). Binding of the unclasped 
aSpi headpiece fragment to the fibronectin ligand (Fig. 3, solid 
line) was greatly reduced but not completely abolished when 
bound by SG/19 (Fig. 3, dotted line), whereas complete inhibi- 
tion was achieved by anti-a5 direct blocking antibody 16 (Fig. 3, 
gray Une). This suggests that the SG/19-bound a5pl integrin may 
become ligand occupied in the presence of a high-concentration 
ligand. We also confirmed that the headpiece fragment used in 
this study was capable of binding ligand in the divalent cation 
condition used in the crystallization (i.e., 1 mM Ca^* and 1 mM 
Mg~*; Fig. 3, inset), although the binding was much weaker than 
that in the presence of Mn^*. 

In line with the aforementioned expectation, soaking of 
the SG/19-a5pl complex crystal in a solution containing 1 mM 
hexapeptide GRGDNP resulted in an appearance of electron 
density above the MIDAS Mg~*, which we interpret as an RGD 
tripeptide segment (Fig. 2 B and Video 3). As the electron den- 
sity of the ligand was weaker than the integrin part, we suspect 
that the occupancy of the peptide was not 100%, and the resul- 
tant structure, refined at 2.9 A, may represent the mean of 
ligand-bound and -unbound forms. Nevertheless, overall struc- 
ture of the RGD-bound a5pl was very similar to that of ligand- 
free integrin (Fig. S2 and Video 4), which is consistent with the 
fact that the soaking did not change the diffraction quality of the 
crystal, nor did it affect the unit cell dimensions (Table 1). In 
the binding pocket, the Arg side chain makes hydrogen bonds 
with a5 residues Q221 and D227, and the Asp carboxylate 
directly coordinates the pi MIDAS Mg^*, recapitulating the 
RGD-binding mode found in p3 integrins (Fig. 2 C). In the 
aVp3-RGD complex structure (Xiong et al., 2002), the Arg of 
RGD is hydrogen bonded to aV D218 from the side, whereas in 
the aIIbp3-RGD complex, it is hydrogen bonded to allb D224 
head on (Fig. 2 C; Springer et al., 2008). The Arg recognition 
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Table ] . Data collection and refinement statistics 
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mode seen in a5 (31 is a mixture of these, with both side-on 
(D227) and head-on (Q221) hydrogen bonds present. Sequence 
alignment around this region suggests that a8, another RGD- 
dependent integrin a subunit, has the side-on residue only (i.e., 
D225) and thus resembles aV (Fig. 2 D). As a5 is predicted to 
be the most ancient among the four (Hynes and Zhao, 2000), the 
head-on residue may have been lost during evolution to yield 
aV and a8 and then reappeared in allb with a concomitant loss 
of the side-on residue. In any case, the basic strategy for RGD 
recognition, in which the extended tripeptide portion is held by 
a two-point anchor situated at the well-shaped binding pocket at 
the subunit interface, seems to be shared among all integrins, as 
predicted earlier (Takagi, 2007). 

The structure of RGD-bound ctSpl was remarkably simi- 
lar to that of the unliganded one, showing an RMSD of merely 
0.27 A for all Ca atoms (Fig. S2). In aV|33 and allbp3 inte- 
grins, binding of RGD or related ligands is accompanied by rear- 
rangements of al and oO helices and nearby segments in the 
pA domain, converting the MIDAS configuration to a high- 
affinity state that is characterized by a direct coordination of the 
last Ser residue in the conserved DXSXS motif to Mg^"^ (Figs. 4 B 
and S3; Xiong et al., 2002; Springer et al., 2008). In a5|31, such 
structural rearrangements are largely prohibited by the bound 
SG/19, resulting in a surprisingly small RMSD of 0.19 A 
between (JA domains of unliganded and RGD-bound forms 
(Fig. 4 A). Nevertheless, Serl34 moves ^1 A toward MIDAS 
to directly coordinate Mg^*, resulting in the high MIDAS con- 
figuration (Figs. 4 A and S3). This indicates that the direct 
coordination of the ligand carboxylate and the Ser hydroxyl to 
MIDAS Mg^* is internally coupled and can be accomplished 
even in the absence of the coordinated movements of the a 1 and 
olI helices. Similar rearrangement in the MIDAS coordination 
chemistry unaccompanied by a 1 and al helix movements dur- 
ing activation has been recently reported for the aA domain of 



the al subunit (Lahti et al., 2011). In addition to the change 
in MIDAS configuration, there was another critical change in- 
duced by RGD binding; the electron density corresponding to 
the ADMIDAS Ca^* was diminished in the RGD-soaked crystal 
(Figs. 2 B and S3 and Video 3), whereas that of LIMBS, MIDAS, 
and all other metals bound to the a5 subunit remained unchanged. 
This ADMIDAS-specific Ca^+ discharge seems inevitable be- 
cause the two pA residues that shifted most upon RGD binding, 
SI 34 and A342 (Fig. 4 A), provided their backbone carbonyl 
to ADMIDAS coordination. 

Very limited but characteristic changes in the 3 1 structure 
upon RGD binding prompted us to perform molecular dynam- 
ics (MD) simulations to see whether these changes can be repro- 
duced in silico. To focus on the local environment sun^ounding 
the ligand-binding pocket and the metals, only the region within 
20 A from the Serl32 was subjected to the calculation (Fig. 5 A). 
In this condition, residues that are anchored by SG/19 were lo- 
cated outside the simulated region, in effect mimicking the 
conformational freezing by SG/19. First, 10 independent MD 
simulations (40 nanoseconds each) were performed on the 
ligand-unbound structure in which all the metal sites were 
occupied. The coordination environment of ADMIDAS remained 
largely undisturbed, and the Ca^* moved from its original posi- 
tion in only 1 out of 10 simulations (Fig. 5 B, top). We then per- 
formed another set of simulations on the same structure after 
manually placing the RGD peptide at the binding pocket above 
the MIDAS Mg^+ to mimic the very first step of the ligand en- 
counter. In this condition, Ca""^ at the ADMIDAS site became 
mobile and was expelled from the site in 5 out of 10 simulations 
(Fig. 5 [B and C] and Video 5). In contrast, both the MIDAS 
and LIMBS metals were highly stable and did not show major 
displacement from the original position. During the time frame 
of the simulation, the Ca^* did not completely diffuse away 
from the site but remained attached to integrin through hydrogen 
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Figure 4. Shape shifting of the |SA domain upon ligand binding. 

(A) Structural superposition of the MIDAS region of pi before (blue) and after 
(yellow) the binding of ligand. (B) Structural superposition of the MIDAS 
region of (33 before (3FCS; green) and after (2VDR; red) the binding of 
ligand. In both A and B, al helix is colored darker than the rest of the 
nnolecule, and the structural differences between the ligand-free and RGD- 
bound forms (plot of distances between Ca otonns in each (3A domain) are 
shown at the bottom. Locations of the al and oJ helices are denoted by 
block horizontal bars. 



bonds with D137 and D138, causing a large disturbance in the 
conformation of the pi-al loop containing these residues (and 
S134). To follow the complete dissociation process of Ca^"^, 
much longer simulation (greater than a microsecond) and a 
more rigorous force field would be required. Therefore, the MD 
result is largely consistent with the ADMIDAS-specific metal 
discharge upon ligand binding, which was observed in the crys- 
tal of SG/19-bound low-affinity aSpl. Next, we focused on the 
movement of SI 34. Unlike in the crystal, significant movement 
of SI 34 toward Mg"* was not observed in the RGD-bound 
structure, and the bond remained a water-mediated indirect 
coordination with a mean distance of 4.48 A (Figs. 5 D [-I-RGD] 
and S4 and Video 6). In the absence of the ROD ligand, how- 
ever, the loop harboring SI 34 exhibited a high degree of mobil- 
ity, and SI 34 tended to move away from the Mg^*, and the 
distance between them frequently exceeded 7 A (Figs. 5 D 
[ligand-free] and S4 and Video 7). These results indicate that 
ligand binding involving the direct coordination of the ligand 
carboxylate to Mg^* acts favorably for the approach of SI 34 
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Figure 5. Effect of ligand binding on the conformation of the MIDAS/ 
ADMIDAS region studied by MD simulations. (A) Illustration of the simulat- 
ing system with GSBP. The region inside the sphere with a radius of 20 A 
centered on Serl32 (red dotted circle) was allowed to undergo explicit 
dynamics, whereas atoms in the rest of the system are taken into account 
implicitly. (B) Occurrence of the ADMIDAS dissociation in 10 indepen- 
dent 40-nanosecond simulations with and without RGD. The ADMIDAS 
Ca'* was considered to have escaped when the three interactions holding 
ADMIDAS (Ca2*...Serl34, Ca2*...AIa342, and Aspl 38...Ser344) were 
all broken and when the distance between MIDAS Mg'* and ADMIDAS 
Ca'* became >11 A. Each trajectory is divided into 10 bins (4 nano- 
seconds for each), and the dissociation state for each bin is indicated 
on the vertical axis (state = 1 means that the event occurrence exceeded 
50%). (C) Example of the ADMIDAS escape. Structural snapshots before 
(0 nanoseconds; top] and after (40 nanoseconds; bottom) the MD simulation 
in trajectory no. 10 of +RGD simulation reveals loss of two coordination 
bonds (black dotted lines) and on increase in the distance between MIDAS 
and ADMIDAS (red dotted lines). See also Video 5. (D) Stabilization of the 
water-mediated coordination of SI 34 to Mg'* by RGD binding. Distances 
between the of SI 34 and the MIDAS Mg'* (inset) were traced over 
time. Representative tracings for simulations in the presence (-fRGD, trajec- 
tory no. 8; see also Video 6) or the absence (ligand-free, trajectory no. 9; 
see also Video 7) are shown. See Fig. S4 for tracings of all trajectories. 



toward MIDAS, which is in agreement with the conformational 
shift that occurred during the RGD soaking experiment. Fur- 
thermore, the failure of MD to reproduce the switch from the 
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Figure 6. A model of the Fn9-10-a5(31 complex. The no. 10 molecule 
from the mouse Fn9-10 fragment nuclear magnetic resonance structures 
(Protein Data Bank accession no. 2MFN) docked onto the a5pl using the 
bound RGD peptide (green stick model) as a guide is shown as a magenta 
ribbon model, with the Argl379 side chain highlighted. Carbohydrate 
moieties are shown by modeling GlcNAc2Man5 residues (space-filling 
model) at the tip of Asn residues. a5 and (31 chains are surface rendered 
in wheat and gray, respectively, except for the three acidic residues (red) 
and MIDAS Mg^* (blue). The trenchlike exposed surface is marked by 
black dotted lines. 



indirect to direct coordination of SI 34 suggests that the high- 
affinity MIDAS configuration may not necessarily be Ugand 
induced but could be ligand stabilized. 

The structure of the aforementioned RGD-a5pl com- 
plex can be used to simulate how the protein ligand (i.e., fibro- 
nectin) binds to integrin. As shown in Fig. 6, the RGD-binding 
pocket is surrounded by several A^-glycan chains, leaving a 
trenchlike exposed surface along the subunit interface. This re- 
sults in a limited choice of docking orientation when the elon- 
gated fibronectin molecule tries to make close contact. Simple 
docking of 30 nuclear magnetic resonance structures of core 
integrin-binding fibronectin fragments (Fn9-10) onto aSpi using 
the RGD segment as a guide resulted in only a handful of com- 
plex models compatible with both structures. Fig. 6 shows one 
of the most successful models, in which the Fn9 module ap- 
proaches the a5 subunit. In this model, the residue known to 
be most important for the synergy activity of the fibronectin 
(Argl379) is pointing toward the negatively charged surface 
created by Glu81, Glul24, and Asp 154 of a5, which is likely 
to constitute the basis for the synergy effects. We individually 
mutated these residues to Ala and prepared recombinant solu- 
ble a5pl ectodomain fragments carrying these mutations and 
tested their ability to bind fibronectin. In the solid-phase bind- 
ing assay, wild-type (WT) integrin aSpi showed ^ 10-50-fold 
lower affinity toward the fibronectin fragment lacking the three 
synergy residues (R1374, P1376, and R1379) than the WT fibro- 
nectin fragment (Fig. 7 A, top left graph; Takagi et al., 2003). 
This synergy site dependency was also observed with mutant 
a5pl carrying either the E81A or the E124A mutation (Fig. 7 A), 
indicating that these residues are not required for the synergy 
site recognition. In contrast, the D154A integrin mutant could no 
longer distinguish between WT and mutant fibronectin (Fig. 7 A, 
bottom right graph), suggesting the critical importance of this 
residue for the synergy effect. Similar results were obtained in 
the surface plasmon resonance analysis using immobilized 
Fn7-10 and solution-phase integrin (Fig. 7 B). As a single resi- 
due mutant of Fn7-10 (i.e., R1379A) was used in this experi- 
ment instead of the triple mutant used in the solid-phase assay. 
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Figure 7. Determination of the synergy site acceptor residue on a5 sub- 
unit. (A) Solid-phase equilibrium binding using immobilized full-length 
a5(3l ectodomain and solution-phase biotinylated Fn7-10. Binding ex- 
periments with WT (closed squares) or Rl 374A/P1 376A/R1 379A mutant 
(open circles) Fn7-10 fragments to four different a5pl preparations were 
performed, and data from one representative of three independent experi- 
ments are shown. (B) Effect of synergy residue mutations on the binding 
kinetics studied by surface plasmon resonance analysis. WT (solid lines) 
and R1379A mutant (dotted lines) biotinylated Fn7-10 were immobilized 
separately onto a streptavidin-coated chip ('^1 ,000 resonance units [RU]). 
The surface was infused for 60 s with either WT (left) or D154A-purified 
(right) full-length recombinant aSpl integrin at 20 nM in a running 
buffer containing 1 mM Mn^*, and the dissociation phase was followed 
for 100 s. The data are from a single representative experiment out of 
three repeats. 



we conclude that Asp 154 of a5 and Argl379 of fibronectin 
constitute the major interacting pair, as predicted from the 
docking simulation. 

□iscussion 

p 1 integrins constitute the largest integrin subfamily and recog- 
nize a wide range of ligands depending on the partner a subunit. 
a5pi was one of the first integrin heterodimers to be identified, 
and its function as a fibronectin receptor has been studied exten- 
sively (Tamkun et al., 1986). Especially, the discovery of the 
site on fibronectin important for binding a5pl integrin (syn- 
ergy site) has set the basis for our general understanding of the 
integrin-ligand recognition mechanism, in which the combina- 
tion of a core interaction (e.g., RGD site) with a secondary inter- 
action (e.g., synergy site) ensures both high affinity and specificity 
(Obara et al., 1988; Aota et al., 1994). It is known that R1379 
makes the greatest contribution to the synergy effect on the 



1 3B JCB • VOLUME 1 B7 • NUMBER 1 • 201 S 



fibronectin side (Redick et al., 2000). Through structure-guided 
mutation experiments, we have now successfully identified the 
specific residue (Asp 154) that distinguishes a5 from other a 
subunits and results in its strong preference for fibronectin over 
other RGD ligands. 

Binding between a5pl and fibronectin has been exten- 
sively studied through biochemical, immunochemical, and cell 
biological approaches (Humphries, 2000), and most of those re- 
sults can be rationalized by the current structure. For example, 
the epitope for the anti-a5 antibody mAb 16 has been mapped 
to El 16/Ll 18 that sits next to the RGD-binding pocket, which is 
in line with the strong and direct function-blocking ability of this 
antibody (Burrows et al., 1999). Leu212, the epitope residue for 
another function-blocking antibody P1D6, is located adjacent to 
D154, explaining why this antibody preferentially blocks syn- 
ergy site interaction (Mould et al., 1997). Ala mutation of 1210 
was reported to severely compromise the interaction with the 
synergy region (Mould et al., 2003b). As the side chain of 1210 
makes van der Waals contact with D 154 to orient its side chain 
outward, the I210A mutation may indirectly affect the synergy 
site binding by partially burying the D154 carboxylate. Also, it 
has been reported that W 157 is responsible for the a5 subunit's 
preference for the RGDGW peptide (Humphries et al., 2000). 
The side chain of W157 rises on one side of the RGD-binding 
pocket, and it is likely that the bound RGDGW peptide makes a 
reverse turn at the second Gly to allow the Trp-Trp ring packing. 
In addition to providing these fine residue-wise structural in- 
sights into the ligand recognition mechanism, the structure offers 
a unique view of how the ligand-binding specificity might be 
regulated by sugar chains. In contrast to the 33 chain, which is de- 
void of A^-glycosylation sites on the top face of the pA domain, 
pi possesses two conserved 7V-glycosylation sites near the Ugand- 
binding site (Fig. 6). These glycans, together with two more gly- 
cans attached to a5, would occupy considerable space because 
of their conformational flexibility. Such a canopy may function 
to selectively capture only the ligands that have the RGD se- 
quence at an apex of a protruding loop (Leahy et al., 1996; Copie 
et al., 1998) or to restrict the preferential orientation of transient 
docking of fibrillar ligands so that the RGD site along the fiber 
can be scanned efficiently. As differences in the glycosylation 
state of a5pl on the cell surface have been reported to affect the 
biological function of the receptor (Scales et al., 2005; Isaji et al, 
2006, 2009), it would be valuable to test these possibilities in a 
cell-based system. 

A previous EM study predicted that SG/19 exerts its in- 
hibitory effect allosterically by preventing the hybrid domain 
swing-out in pi (Luo et al., 2004), and the current study con- 
firmed the lack of direct occlusion of the fibronectin-binding 
surface by this antibody. The crystal structure also revealed that 
the binding of SG/19 not only restricted the global domain ori- 
entation but also fixed the local conformation of the pA domain 
by directly binding to the lower half of the al helix. In p3, this 
helix moves inward to push the a7 helix during the transition to 
the high-affinity state, resulting in the approach of ADMIDAS 
toward MIDAS (Fig. 4 B; Xiong et al., 2002; Luo and Springer, 
2006). We predict that a similar conformational change can take 
place in pi during the ligand binding, as suggested earlier by 



Mould et al. (2002), if the al helix is not frustrated by SG/19. 
As this inward movement of the al helix was observed in all 
ligand-bound p3 structures reported so far regardless of the 
conformation of the hybrid domain, it is likely that the SG/19 
exerts its inhibitory action primarily through the al helix rather 
than the hybrid domain. In fact, epitopes for a series of anti- 
chicken pi antibodies with function-blocking activity have 
been mapped exactly to this region (Shih et al., 1997), suggest- 
ing an intimate relationship between the ligand binding and the 
conformational freedom of the al helix. Although there is no 
anti-human p 1 antibody mapped to this region as a result of the 
sequence conservation between human and mouse pi, there are 
many function-modulating (i.e., inhibitory and activating) anti- 
bodies mapped to the neighboring a2 helix (Takada and Puzon, 
1993; Luo et al., 2004). These antibodies are likely to recognize 
residues in al helix as well, just like in the case of SG/19. There- 
fore, the segregation of epitopes for many function-modulating 
antibodies in this region suggests that this region exists in at 
least two alternative conformations (i.e., high- and low-affinity 
forms) and that each antibody affects the ligand-binding ability 
depending on the conformer it stabilizes. 

High-resolution structures of the allbp3 headpiece re- 
vealed that the p3 MIDAS always assumes octahedral geome- 
try, with the top axial coordination site reserved for a water or 
ligand carboxylate. The configuration of the MIDAS metal 
can be grouped into two types that differ by only one coordi- 
nation site, one with a direct coordination from the side chain 
of the last Ser residue of the DXSXS motif (MIDAShigh con- 
figuration) and the other with the same bond replaced with a 
water-mediated indirect coordination (MIDASiow configuration; 
Fig. S3). In P3 integrins, binding of a ligand or a ligand-mimetic 
analog is tightly coupled with the MIDAShigh configuration 
(Zhu et al., 2010). Here, we showed that the same coupling 
exists in pi integrin. Our MD simulation data also hinted an 
energetic linkage between the two coordination bonds. Fur- 
ther quantum mechanical calculations would reveal the pre- 
cise mechanism of the coupling. 

As we used the allosteric inhibitory antibody SG/19 to 
minimize the interdomain flexibility intrinsic to a5pl integrin 
to facilitate crystallization, the RGD-bound a5pl structure we 
obtained must be regarded as somewhat artificial. Nevertheless, 
this procedure may have served to visualize a structural state 
corresponding to the very first step of physiological ligand bind- 
ing by integrins on the resting cell surface. Thus, a ligand can 
approach the resting integrin with MIDASiow to produce transient 
ligand docking. The ligand-bound MIDASiow would quickly 
be converted to MIDAS^gh by pulling the al-pi loop (and the 
Serl34 therein) closer, together with the Ca^^'-loaded ADMIDAS. 
In our crystal structure, however, only the SI 34 (and the fol- 
lowing two residues) are forced to move toward Mg^* to fulfill 
the requirement for MIDAS^gh configuration because full-loop 
movement is prohibited by SG/19 anchorage of the C terminus 
of the al helix. At the same time, this movement destabilized 
the coordination environment of the ADMIDAS, resulting in 
the release of Ca^*. 

Although the physiological importance of ADMIDAS in 
integrin function is well appreciated, the exact role played by 
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this metal remains somewhat controversial. Mould et al. (2003a) 
reported that mutation to ADMIDAS in aSpl resulted in a 
diminished ligand-binding ability. They also observed that the 
mutant became resistant to the artificial activation by an activat- 
ing antibody, TS2/16, and that activation by Mn^"^ no longer 
exposed the activation epitope. From these observations, the 
authors concluded that ADMIDAS was critical in maintaining 
the active conformation of the receptor. On the other hand, 
Chen et al. (2003) found that a similar mutation in the ADMIDAS 
of a4p7 integrin expressed on the cell surface canceled the de- 
fault low-affinity state of the integrin, resulting in the overall 
up-regulation of cell adhesion. This led to a proposal that the 
ADMIDAS represents a negative regulatory site for integrin 
function. These seemingly contradicting results can be recon- 
ciled if we take a view that the function of ADMIDAS is to 
physically link the change in the ligand-MIDAS bond to the 
change elsewhere in the molecule, particularly the hybrid 
domain via the al helix. Thus, the mutation of ADMIDAS resi- 
dues would decouple the conformational relay in both direc- 
tions, resulting in either activation or deactivation of a particular 
integrin depending on its default affinity state. 

There has been a debate over how the integrin conforma- 
tion and the ligand binding are mutually related. Particularly, 
the hypothesis that a global conformational change (i.e., a 
switchblade-like integrin extension) is linked to the local con- 
formational up-regulation of the ligand-binding MIDAS has 
been challenged by Adair et al. (2005), who observed a com- 
pact (bent) conformer of aVpS upon fibronectin binding. This 
discrepancy can be reconciled with the current structure because 
it unequivocally showed that even an integrin permanently main- 
tained in a low-affinity conformation by an inhibitory antibody 
is capable of ligand binding via the high-affinity form of MIDAS . 
In addition, we have recently found, using EM, that (31 inte- 
grins do not assume acutely bent conformation under the non- 
activating condition (unpublished data). It is possible that the 
ligand binding and the overall integrin conformation are less 
tightly coupled than originally thought, especially in pi inte- 
grins. Nevertheless, the existence of local conformational cou- 
pling between the ligand-binding pocket and the a 1 helix-hybrid 
domain region in p subunit is supported by an overwhelming 
number of studies (Mould et al., 2002; Xiong et al., 2002; Xiao 
et al., 2004; Zhu et al., 2010). Exactly how this conformational 
relay system, exerted mainly by ADMIDAS, contributes to the 
inside-out and outside-in integrin signal transduction needs 
to be explored in more detail using structural as well as cell bio- 
logical analysis. 

Materials and methods 

Preparation of the aSpl integrin headpiece and SG/19 Fab fragment 

The expression construct for the a5 subunit contained residues 1-623 fol- 
lowed by a 30-residue ACID-Cys peptide, and the construct for pi con- 
tained residues 1-445 followed by a Tobacco etch virus (TEV) protease 
recognition sequence, a 30-residue BASE-Cys peptide, and a hexohisti- 
dine tag. When combined, the C-terminal ACID-Cys and BASE-Cys seg- 
ments form an intersubunit disulfide-bridged a-helical coiled-coil (called 
clasp) that can be released by a treatment with TEV protease (Tokogi et al., 
2002). These constructs were cotransfected into CHO Lec 3.2.8.1 cells to 
establish stable cell lines. Recombinant integrins were purified from the 



culture supernatonts by on immunoaffinity chromatography using anti- 
coiled-coil antibody 2H 1 1 (Chang et al., 1 994) followed by a gel filtration 
on a Superdex 200 HR column (1 .6 x 60 cm; GE Healthcare) equilibrated 
with 20 mM Tris and 150 mM NaCI, pH 7.5 (TBS), containing 1 mM 
CaCl2 and 1 mM MgCl2. The peak fraction was concentrated to ~1 mg/ml 
and stored at — 80°C until used. Removal of the C-terminal coiled-coil 
clasp and the hexahistidine tog was achieved by treatment with TEV prote- 
ase at room temperature for ^16 h. SG/19 IgG from mouse hybridoma 
cell culture supernatant was purified using a Protein A column (GE Health- 
care). SG/19 Fab fragment was prepared by papain digestion using im- 
mobilized papain (Thermo Fisher Scientific). The DNA sequences for the 
variable region of the SG/1 9 heavy and light chains were determined as 
follows: in brief, total RNA was isolated from the hybridoma cells using the 
SV Total RNA isolation kit (Promega), and the cDNA was amplified by 
RT-PCR reaction using the OneStep RT-PCR kit (QIAGEN) with Ig-Primer 
Sets (EMD). The PCR products were cloned into pDrive vector (QIAGEN) 
and then sequenced. Sequences have been deposited in GenBank/EMBL/ 
DDBJ under accession no. HE578877 (for heavy chain) and accession 
no. HE578878 (for light chain). 

Crystallization and data collection 

Unclasped a5(3l fragment was incubated with a saturating concentration 
of SG/1 9 Fob fragment, and the resultant integrin-Fob complex was puri- 
fied on a Superdex 200 column equilibrated with 50 mM Tris-HCI, pH 7.5, 
150 mM NaCI, 1 mM CaCb, and 1 mM MgCl2. Purified protein was con- 
centrated to 6 mg/ml for crystallization using an Ultrafree-0.5 centrifugal 
concentrator (10-kD molecular mass cutoff; Millipore). Initial screening for 
crystallization conditions was performed using Index (Hampton Research). 
For this screen, a mosquito crystallization robot (TTP LabTech) was used to 
dispense 200 nonoliters of protein solution mixed in a 1:1 ratio with the 
reservoir solution. Drops were equilibrated over 1 00 pi of reservoir solution 
using the sitting drop vapor diffusion method at 293 K. The initial crystalli- 
zation condition (0.1 M bis-tris, pH 6.5, and 20% polyethylene glycol 
monomethyl ether 5000) was further optimized using a 24-well crystalliza- 
tion plate with the hanging drop vapor diffusion method. Each well con- 
tained 500 pi of reservoir solution, and the drop volume was a mixture of 
0.3 pi of protein solution and 0.3 pi of reservoir solution. The reproducibil- 
ity of the crystals was greatly improved by the addition of microseed crys- 
tals in the crystallization drop. Diffraction-quality crystals were obtained 
under a condition of 0.1 M bis-tris, pH 6.5, and 20% polyethylene glycol 
8000 at 293 K and grew to dimensions of ~1 00 x 1 00 x 20 pm within 
2 wk. The RGD peptide complex crystals were prepared by soaking the crys- 
tals of ligond-free form in 1 mM RGD peptide (NH2-GlyArgGlyAspAsnPro- 
COOH] for 30 min at 293 K. 

Before x-ray diffraction experiments, crystals were soaked in reser- 
voir solution containing an additional 20% ethylene glycol and flash cooled in 
liquid nitrogen. X-ray diffraction dotasets for the crystals were collected at 
95 K on beomline BL17A at Photon Factory using a wavelength of 1 .000 A 
and a charge-coupled device detector (Quantum 270; Area Detector Sys- 
tems Corporation). All dotasets were processed and scaled using HKL2000 
program suite (Otwinowski and Minor, 1997). 

Structure determination and refinement 

Initial phase determination was performed by molecular replacement using 
the program MQLREP (Vagin and Teplyakov, 1997) in the CCP4 program 
suite. The atomic coordinates of the aVp3 headpiece (Protein Data Bonk 
accession no. 1 L5G) and a murine Fab fragment (Protein Data Bonk acces- 
sion no. 1 FGN) were used as search models. The orientation and position 
of p-propeller and thigh domain of a5 subunit and pA domain of pi sub- 
unit were initially determined. Subsequently, the positions of variable and 
constant regions of SG/19 Fab fragment were determined. After several 
cycles of rigid body refinement by the program REFAAAC5 (Murshudov et al., 
1 997) of the CCP4 program suite, the hybrid domain of p 1 subunit could 
be introduced. Model reconstruction was conducted manually with Coot 
(Emsley and Cowton, 2004). Crystallogrophic refinement was performed 
using the program REFMAC5. As a final check on the model, the stereo- 
chemical quality was assessed using the program Molprobity (Lovell et al., 
2003). The final models included the following residues: chain A, 1-600; 
chain B, 6-29 and 43-445; chain C, 1-512, 518-553, and 560-600; 
chain D, 5-33 and 43-445; chains E and L, 1-219; and chains F and H, 
1-218. Data collection and refinement statistics are summarized in Table 1. 
All figures were prepared using PyMOL (Delano Scientific). Atomic coordi- 
nates and structural factors have been deposited in the Protein Data Bonk 
under accession no. 3VI3 (unliganded form] and accession no. 3VI4 (RGD 
peptide complex). 
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MD simulation 

All atom MD simulations of the pA domain of a5pl were performed using 
the CHARMM (c35b2] software package (Brooks et ol., 1983). The 
CHARMM27 force field parameter (MocKerell et ol., 1998) with CMAP 
correction (MocKerell et ol., 2004) was used for the protein and the RGD 
ligand. The parameter developed by Bobu and Lim (1999) was used for 
Ca^* and Mg^*. The TIP3P model Porgensen et al., 1983) was used for 
water molecules. The crystal structure of the ligand-free form was used as 
the starting structure without modification, except for the manual placement of 
the RGD peptide at location defined by the RGD complex structure when 
calculating the effect of ligand binding. We used a generalized solvent 
boundary potential (GSBP; Im et ol., 2001) and focused dynamics of pro- 
tein around the metal-binding sites. To set up the system for use with GSBP, 
the region surrounding three metal-binding sites (LIMBS, MIDAS, and 
ADMIDAS) was solvated using a preequilibrated solvent sphere with a radius 
of 20 A centered on Serl 32 of pi . Solvent molecules within 2.6 A of any 
nonhydrogen atom of the protein were removed. The 1 0-picosecond equili- 
bration and the aforementioned solvation process were iteratively repeated 
until the number of newly added solvent molecules became less than five. 
The final equilibrated structure was used for 10 independent MD runs. We 
assigned different initial velocity for each run. MD simulations were per- 
formed using Langevin dynamics at 300 K with a friction constant corre- 
sponding to a relaxation time of 5 picoseconds"' applied to the nonhydrogen 
atoms. The geometry of the water was kept fixed using SHAKE (Ryckoert 
et al., 1 977). A time step of 2 femtoseconds was used. After 500 picosec- 
onds of equilibration (1 00 picoseconds of solvent relaxation, 200 picosec- 
onds of protein relaxation with the fixed MIDAS coordination structure, 
and another 200 picoseconds of full relaxation], we performed 40 nano- 
seconds of MD simulation for each run. The trajectories without showing 
the Ca^* dissociation were used for all analysis, except for analyzing the 
dissociation itself. 

Fibronectin-binding assays 

All recombinant fibronectin fragments encompassing the 7th to 10th Fn3 
repeats (Fn7-10) were produced using a bacterial expression system as 
previously described (Tokogi etal., 2003). In brief, a segment correspond- 
ing to residues 1,142-1,509 of human fibronectin with one Cys residue 
added after residue 1,509 was cloned into pETl Ic vector (EMD) and ex- 
pressed in Escherichia coli. Protein was purified from bacterial lysates by 
onion-exchange chromatography on a HiTrop Q column (GE Healthcare) 
and biotinylated via the sulfhydryl group of the Cys with polyethylene 
oxide-maleimide-activoted biotin (Thermo Fisher Scientific) according to the 
manufacturer's recommendation. Synergy site mutants with triple (Rl 374A/ 
P1376A/R1379A] or single (R1379A) mutations were prepared by Quik- 
Change mutagenesis. To produce full-length a5pl ectodomain fragments, 
293T cells were transiently transfected with vectors encoding a5 (residues 
1-954) and pi (residues 1-708) with the C-terminol coiled-coil, and the 
covolently linked heterodimeric proteins carrying a5 mutations (E8 1 A, E 1 24A, 
or D154A) or the WT sequence were purified from the culture superna- 
tants. Solid-phase binding assay was performed as follows: solutions of 
purified full-length a5pl ectodomoins (5 pg/ml in TBS containing 1 mM 
CaCl2 and 1 mM MgCb) were used to coot 96-well microtiter plates (Moxi- 
Sorp; Thermo Fisher Scientific) by on overnight incubation at 4°C. Coating 
with BSA was used to determine the background values of unspecific bind- 
ing. After a 1-h blocking step (1% BSA in TBS), varying concentrations of 
biotinylated Fn7-1 0 were incubated in the presence of 1 mM Mn'* for 3 h 
at room temperature. After washing, bound Fn7-10 was chromogenically 
detected by peroxidase-streptavidin conjugate and substrate. For the sur- 
face plasmon resonance binding experiments, biotinylated Fn7-10 was 
directly captured on streptovidin-conjugated Sensor Chip SA (GE Health- 
care). Various a5pl integrin heterodimers were used as onalytes at a flow 
rate of 20 pl/min. Between each run, bound integrin was completely 
stripped off from the ligand by regeneration of the surface with 50 mM 
NoOH containing 20 mM EDTA. 

Online supplemental material 

Figs. SI and S2 show the structural superpositions between two a5pl-SG/l 9 
complexes contained in the asymmetric unit (Fig. SI ) and between the ligand- 
free and RGD-bound complexes (Fig. S2]. Fig. S3 shows the detailed depic- 
tions of metal coordination geometries in the MIDAS and ADMIDAS of pi and 
p3 chains. Fig. S4 shows the SI 34(0-y)-Mg^* distance tracings for all trajecto- 
ries during the MD simulations. Videos 1 and 4 present rocking videos show- 
ing the overall structure of the a5pl headpiece-SG/19 Fab complex in the 
absence (Video 1) and presence (Video 4) of RGD ligand. Videos 2 and 3 
present rocking videos of the 2Fo-Fc electron density mop of the ligand- 
free (Video 2) and RGD-bound (Video 3) integrin near the MIDAS region. 



Videos 5-7 show the close-up view videos of metal-binding sites during the 
MD simulation from the trajectory obtained in the -i-RGD simulation (Videos 5 
and 6) or the ligand-free simulation (Video 7). Online supplemental material is 
available at http://v\'ww. jcb.org/cgi/content/full/icb.201 1 1 1077/DCl . 
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